Estimating a Bivariate Density When There Are Extra Data on One or Both Components
نویسندگان
چکیده
Assume we have a dataset, Z say, from the joint distribution of random variables X and Y , and two further, independent datasets, X and Y , from the marginal distributions of X and Y , respectively. We wish to combine X , Y and Z, so as to construct an estimator of the joint density. This problem is readily solved in some parametric circumstances. For example, if the joint distribution were normal then we would combine data from X and Z to estimate the mean and variance of X ; proceed analogously to estimate the mean and variance of Y ; but use data from Z alone to estimate E(XY ). However, the problem is more difficult in a nonparametric setting. There we suggest a copula-based solution, which has potential benefits even when the marginal datasets X and Y are empty. For example, if the copula density is sufficiently smooth in the region where we wish to estimate it, then the effective dimension of the structure that links the marginal distributions is relatively low, and the joint density of X and Y can be estimated with a high degree of accuracy. Similar improvements in performance are available if the marginals are close to being independent. We suggest using wavelet estimators to approximate the copula density, which in cases of statistical interest can be unbounded along boundaries. Our techniques are also useful for solving recently-considered related problems, for example where the marginal distributions are determined by parametric models. Therefore the methodology has application beyond the context which motivated it. The methodology is also readily extended to more general multivariate settings.
منابع مشابه
A blended model for estimating of missing precipitation data (Case study of Tehran - Mehrabad station)
Meteorological stations usually contain some missing data for different reasons.There are several traditional methods for completing data, among them bivariate and multivariate linear and non-linear correlation analysis, double mass curve, ratio and difference methods, moving average and probability density functions are commonly used. In this paper a blended model comprising the bivariate expo...
متن کاملModel Selection for Mixture Models Using Perfect Sample
We have considered a perfect sample method for model selection of finite mixture models with either known (fixed) or unknown number of components which can be applied in the most general setting with assumptions on the relation between the rival models and the true distribution. It is, both, one or neither to be well-specified or mis-specified, they may be nested or non-nested. We consider mixt...
متن کاملEstimating a Function by Local Linear Regressionwhen
1 AMS 1991 subject classiications: primary 62G05; secondary 62J99. Abstract Automated bandwidth selection methods for nonparametric regression break down in the presence of correlated errors. While this problem has been previously studied in the context of kernel regression, the results to date have only been applicable to univariate observations following an equidistant design. This article ad...
متن کاملBivariate Density Estimation with an Application to Survival Analysis
A procedure for estimating a bivariate density based on data that may be censored is described After the data are transformed to the unit square the bivariate density is estimated using linear splines and their tensor products The combined procedure yields an estimate of the bivariate density on the original scale which may provide insight about the dependence structure The procedure can also b...
متن کاملA Statistical Method for Estimating Luminosity Functions using Truncated Data
The observational limitations of astronomical surveys lead to significant statistical inference challenges. One such challenge is the estimation of luminosity functions given redshift (z) and absolute magnitude (M) measurements from an irregularly truncated sample of objects. This is a bivariate density estimation problem; we develop here a statistically rigorous method which (1) does not assum...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005